-1, Describe in detaii what theWmponents of tiie invention are an^ow the iSJStS^S?^ 
INVENTION 

JfJm Jiff "^rSfif P^"®'^*®^ S/^P^'cs are represented by a finite number of screen pixels, tfiere will be some visual 
anornal.es called the jaggies or staircasing. a phenomenon known as aliasing. The application of techniques that reduce 
a asmg .s called anting, of which full-scene anti-aliasing via supersampling is one of these tSSies OlheranT 
aliasing methods include sub-pixel computations, edge blending, and color accumulation. 

fcHSf"^" ® f J""'! ^PP"'^*^^ to full-scene anti-aliasing in which the original scene is rendered at a higher 
resolution and then firtered down to the onginal screen resolution. This in effect raises the Nyquist limit, which simolv shifts 
rJetSS'k!if;°' V'' -^i ^^«q"«"°y- Though the technique does not eliminate aliasing compteShe 

Tr^wS^Jfc^ "'^ W^CLgraBhicsacce!ei»tors today. However, there are sorSe perfomiance 

drawbacks when this technique is employed, namely the extra processing, and memory storage and bandSh reauired to 

S^Son^/Hf i '^!- down'supersamplKg twoLeslnlJroTx and y 

directions (/^=4) results in four time of processing, storage, and bandwidth. This patent describes an efficient 
2lE ?"Peraampling without incurring extra memory storage and bandwidth by using a tile^jased rendering 

architecture in conjunction with a unified graphics cache. "oow renaenng 

llfh'It''®®®^*^"? '^?^ architecture with a unified graphics cache model, the color anririApth»ai..oc for pixels inside 

■ ^ly*" !? ? °^ """"^^ (Pgu rel). lhetilesS fede{ei!mSS on 

itnSm^'S ''^'J? *°""^??o'?S^ unifiedVWcs cache! For initance a 64KB unified graphics^elan 

f ^'3^ °* ^l^^^ P"*®'"' P''*®' "'"^'s*^ °^ 32bit color and depth values. Besides color and depth 
data, the unified graphics cache also stores the texture data. 
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Figure 1: A L2 Sharing Model in an integrated microprocessor using tile-based 3D rendering architecture 

tn"i?.HHf '® 3* ^ "The -Graphics Color/Z" cache is large enough 

to fulf II intemriediate color and 2 data accesses for all triangles that fall inside the tile. The color and 2 data are written 

ffflf/ .T® '"if®' rendering. All pixels in the tile are then considered 

f„ fl?^ "^IT '^/^"^ered again. The purpose of this invention is to exploit the benefit of a tile-based architecture 
in conjunction with the micro-architectural features of a unified graphics cache to perfomi supersampling efficiently. 

The patent covers efficient implementation of supersampling by eliminating the extra memory storage and bandwidth 
requirements using the unified graphics cache model. Figure 2 below describes the data flow when supersampling is 
enabled. For simplicity, a value of /c= 4 is assumed by supersampling 2* in both X and Y directions. The technique is 
equally applicable to any other /rvalues. Additionally, the "physicar tile size is assumed to be 128x64. and polygons are 
software-binned into a "virtual" tile size of 64x32. 
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-2£-Da<^ Flow Diagram for Efficient Supersampling Using Tile*Based Rendering with a Unified Graphics Cache 



1. Polygons, already binned into tiles, are received in their original forms into the graphics core, but are internally amplified 
4x of Its original size. This is achieved through the viewport transformation supported by the graphics setup engine. 

2. The enlarged polygons are tile-based rendered into the "GFX C/Z Tile Buffer" in the unified graphics cache (red line 
#1). Texture data for polygons being rendered are accessed from the "GFX Texture Cache** If hits occur to the cache 
(pink tine #2). 

3. After the last triangle in the tile finishes rendering, the "GFX C/Z Tile Buffer" contains the complete Image of the tile that 
Is 4x of its original tile size. 

4. A stretch BLT is performed to down sample the Image from the virtual tile down to the physical tile size. This Is 
accomplished by rendering a rectangle (made up of two polygons) of the size equal to that of the physical tile size. The 
supersampled image in the virtual tile (still stored In the "GFX C/Z Tile Buffei^) is considered the source of the stretch 
BLT, while the destination Is allocated In the extemal memory. Intemally, the micro-architecture treats the source of 
the stretch BLT as a texturie map for the destination rectangle (blue line #3). The "GFX Texture Cache" is kept 
undisturbed to maintain good utilization of the texture data across tiles. 

5. While the stretch BLT Is occurring and its results being written out to the "Extemal Memory" (green line #4), rendering 
of the next tile can begin in the pipelined engine. 

ADVANTAGES 

Using a tile-based rendering and a unified graphics cache architecture, efficient anti-aliased (via supersampling) Images 
can be created without increasing the extemal memory storage and bandwith requirements. This is achieved through the 
use of the unified graphics cache that provides a temporary storage for the supersampled image to be later filtered down 
(through streatch BLT). In this way. only the final image of the original size needs to be written out and stored in the 
extemal frame buffer. On the contrary, a non-tiled based rendering engine must first render the entire supersampled 
image to a memory surface of the size of /c times of the original screen resolution before the downsampling can occur. 
Typically, this memory is too large to be Implemented on-chip which leads to an increase in the memory storage and 
bandwidth requirements. 

This inventton Is valuable to an integrated microprocessor targeting the Value PC segment where cost Is a primary 
concern. Because of the cost constraint, the memory subsystem bandwidth is often limited due to the number of memory 
channels available. By avoiding extra memory bandwidth and storage requirements, an Integrated microprocessor can 
take advantages of the unified graphics cache and the tile-based rendering architecture to render images with full-scene 
antialiasing feature enabled with a minimal perfomnance penalty. 
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